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Abstract 

The MRFM device is a powerful setup for manipulating single electron spin in resonance 
in a magnetic field. However, the real time observation of a resonating spin is still an issue 
because of the very low SNR of the output signal. This paper investigates the usability and the 
efficiency of sequential detection schemes (the Sequential Probability Ratio Test) to decrease 
the required integration time, in comparison to standard fixed time detection schemes. 

1 Introduction 

Magnetic Resonance Force Microscopy (MRFM) is a promising technique for high-resolution non- 
destructive spatial imaging. One of the most exciting challenge proposed for MRFM is the ob- 
servation of single spin in resonance in a magnetic field. Sidles demonstrated the capability of 
the MRFM for manipulating proton spins [14]. The use of MRFM has since been extended to 
the observation of electron spins through the use of the OScillating Cantilever-driven Adiabatic 
Reversal (OSCAR) method [11, 16]. Although micro-size ensembles of electron spins have been 
detected [21], generating forces as low as 8 x 10 -10 Newton [12], observing a single electron spin in 
resonance is still an issue because of the weakness of the signal. For the current signal-to-noise ratio 
(SNR), the required integration time for detection is too long to allow a real-time implementation. 
The integration time is expected to decrease as technological advances occur improving cantilever 
sensitivity to single spins and decreasing noise sensitivity. Improvements can also be obtained by 
making use of advanced signal processing techniques. This is the focus of this paper. 

Currently, the presence of one (or several) electron spins in resonance is detected by standard 
methods of statistical detection. A statistic of an observed sample of fixed length is compared to 
a threshold. The setting of this threshold splits the space of the statistic into two decision regions. 
A level of confidence (i.e. a probability of error) is associated to each decision regions. The prob- 
ability of error decreases when the observation time (the number of data) increases. In order to 
reach an acceptable level of confidence, the observation time is currently of the order of eight hours. 

Sequential analysis was introduced for generic hypothesis testing problems by Wald in 1947 [17] 
to make decision with a reduced amount of data. This is of great interest for instance in clinical 
trials where ethical considerations require making decision as soon as possible [2], [15]. The most 
important feature of Wald's procedure is that the number of data required to make a decision is a 
random variable. The borders of the decision regions (and thus the probabilities of error) depend 
on this random variable through its expected value called the Average Sample Number (ASN). 
The first Sequential Probability Ratio Test (SPRT), proposed by Wald was designed to test a 
simple hypothesis Hq versus a simple alternative H\. Data are recorded and tested sequentially 
until a condition on the likelihood ratio to accept one of the hypotheses H is met. The ASN of 
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the SPRT is smaller than the amount of data required for any fixed sample size test to achieve the 
same decision error probabilities. 

However, the SPRT presents two main drawbacks. Practically speaking, the absence of any 
upper bound on the stopping time may make the ASN higher than the actual number of data 
available. Moreover, if there is a mismatch of the Hq or Hi models and the data, the expected 
stopping time may be large and, consequently, a sequential procedure may not improve on a fixed 
sample size procedure [4]. Much work has been done to avoid such shortcomings. The main feature 
of the modified SPRT proposed in the literature is to introduce a bound on the stopping time [1] , 
[2]. This led to a class of test called Truncated Sequential Probability Ratio Test (TSPRT). When 
performing a TSPRT, a decision is taken at a given sample size N even if neither of the stopping 
conditions has been met before N. Such modification increases the error probabilities. Another 
way to deal with the uncertainty and mismatch on the hypotheses is to take into account a priori 
information by means of the Bayes formalism [10]. Many monographs have been published since 
the early book of Wald. Most of them adopt a probabilistic approach [15], [19], by focussing on 
the error probability aspect. Wijsman's approach in [20] is slightly different; since it is common to 
consider sequential procedures by means of Brownian Motion, his analysis of sequential test (and 
more generally sequential procedures) relies on elements of the theory of diffusion which makes it 
an original introduction to sequential analysis. 

The aim of this paper is to investigate the usability of the SPRT in the specific problem of 
detecting a single electron spin in the OSCAR setting. We focus on two sequential tests of the 
variance of the observed signal, namely a x 2 and a Fisher-F test. The main contribution of this 
paper is to derive the exact expression of the ASN of the \ 2 test and a low snr development of the 
ASN of the Fisher-F test. These expressions allow a comparison to the number of data required 
to perform the corresponding fixed sample size tests. A procedure is also proposed to perform an 
experimental validation of this comparison in the case of the x 2 test. 

2 Data processing and fixed sample size detection scheme 

We address the problem of detecting an electron in resonance in the OSCAR experiment. Before 
performing the detection procedure, the data are pre-processed in order to enhance the perfor- 
mances of the detector. In this section we briefly describe the OSCAR setup and the associated 
signal processing. 

2.1 Data models 
2.1.1 General outline 
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Figure 1: The OSCAR setup. 



In the OSCAR experiment, presented in Figure 1, a sample is embedded in a Radio- Frequency 
(RF) magnetic field B\. If the RF-field frequency matches the Larmor frequency of an electron 
in the sample, the electron spin is in magnetic resonance [5]. A cantilever with a ferromagnct 
on its tip is settled close to the sample. The cantilever is forced into mechanical oscillations at 
frequency ojq which induces an oscillating magnetic field. As a consequence, the spin polarity of 



any free electron in the resonant slice of the device is forced to reverse synchronously with the 
ferromagnet motion. Moreover, in the so-called interrupted OSCAR experiment, the RF-field is 
turned off every T s ki P seconds so that the spin polarity is reversed periodically. The successive 
steps of the pre-processing are presented on Figure 2. 

Output of the interferometer. The spin reversal induces a slight change in the cantilever 
stiffness. The electron spin can thus be detected by observing a shift Sluq in the natural frequency 
of the cantilever. The motion of the cantilever is measured by a laser interferometer. When a spin 
is resonating in the resonant slice, the output of the interferometer is a frequency-modulated signal 
of form: 

z(t) = A cos{ujQt + / s(u)du + cj)}, (1) 
Jo 

where A is the amplitude of the oscillation, is a random phase and s(u) is a square- wave of period 
2T s ki P and magnitude Sojq. When no electron in the resonant slice is resonating, there is no shift 
in the natural frequency and z(t) = Acos{tuot + <f\. 

Spin relaxation. The spin can spontaneously go out of alignment with the magnetic field. 
This phenomenon called spin relaxation is not fully understood. One model for the effect of spin 
relaxation is that when relaxation occurs, the polarity of the spin changes. This causes A random 
flips per second. The relaxation phenomenon is taken into account in the modelling of the output of 
the cantilever by means of a random telegraph signal. In continuous time, the number of random 
flips is considered as following a Poisson distribution with rate AT where T is the duration of 
the observation. The equivalent discrete-time model is a 2-state Markov chain. If the transition 
probabilities p between states are equal, then p = 1 — T S A where T s is the sampling rate. Other 
models include random walks on the sphere and random walks on the interval. See [16] for a review 
of these different models. 

Frequency demodulation and sampling. The square-wave is estimated by demodulating the 
signal with a frequency lock-in device. Then the output of the frequency lock-in is sampled. 
The frequency lock-in can be seen as a frequency estimator whose variance induces an additive 
noise. Considerations on the dynamical system describing the spin-cantilever interaction lead to 
a natural alternative which makes use of an efficient estimator of the square-wave related to the 
MUSIC algorithm [6]. 

Output of the correlator and filtering. In order to remove the deterministic square-wave, 
the discrete signal is correlated to a reference square- wave of period 2T s ki P resulting in a baseband 
signal with the natural frequency loq removed. This is the so-called in phase filtering of the output 
of the frequency estimator. 

The signal-to-noise ratio is increased by filtering the sampled data over the pass-band of the spin 
signal with a low-pass filter defined by the recursive relation: 

1 — a . . , . 

x n = aac n _i H — (z n + z n -i), (2) 
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Figure 2: The pre-processing scheme. 



where z n is the input of the filter and x n , the output. The cut-off frequency uj c of this recursive 
filter is set by parameter a: 

1 - sinjcjj 

a= — — . (3) 

COS{w c } 

This filtering induces a coloration of the embedding noise. In order to consider that the assumption 
on independence of the data samples is valid, the output of the filter is subsampled at the rate 
2tt/lo c . 

2.1.2 SNR estimation 

A noise-alone reference can be generated by correlating the demodulated signal with a version of 
the reference square- wave phase-shifted by 90°. This is the so-called quadrature filtering. The 
ratio of the energies of the output of the in-phase channel to the output of the quadrature channel 
provides an estimate of the signal-to-noise ratio. Under an i.i.d. Gaussian assumption on the 
sampled demodulated cantilever signal, this ratio is by definition a Fisher-F random variable. It 
will be used as the test statistic for the so-called Fisher-F test that we describe in this paper. 
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Figure 3: The SNR estimation scheme. 



2.2 The energy detector. 

The single spin detection problem can be formulated as making a decision between the two hy- 
potheses: 

Hq ■ %n — ^ni f a\ 

Hx : x n =d n + v n , 

where d n is a random signal with RMS amplitude a<i and intensity A and v n is a white Gaussian 
noise with zero-mean and variance a 2 ,. When the SNR enja 2 is sufficiently small we can consider 
x n as a Gaussian random variable with zero-mean and variance a 2 + a 2 , under H\ . In this case the 
single spin test consists in deciding: 

Hi: x n ~Af(0,cr 2 u +(T 2 d ), 



(5) 



where X ~ Y means that the random variable X has the same PDF as the random variable Y. 
The detection procedure is applied to the energy E% = J2n=i x n- The energy Ex under both 
hypotheses are sums of N independent identically distributed squared Gaussian variables. They 
are random variables following x 2 distributions with N degrees of freedom and scale parameter 
equal to the variance of the Gaussian variables. The hypotheses (5) can be equivalently formulated 
as: 

The Probability Density Function (PDF) of a o 2 X% random variable with scale parameter er 2 and 
df degrees of freedom is of the form: 

f 2 2 ( X ) = - df/2-l -x/2a 2 (7) 

where T(u) is the gamma function. 



2.3 The Fisher-F test 



When of is unknown, the x 2_ test derived from the energy detector can no longer be applied. An 
alternative detection procedure is based on the ratio of the energies of the quadrature and in-phase 
channel components. Under the hypotheses given by (4), the models for the in-phase channel 
component x l n and for the quadrature component channel x\ are: 

H : 4 ~ N(0,o*) and x% ~ Af(0, a*), (] 

H x : 4 ~ M(Q, al + of) and x% ~ J\f(0, ol). K > 

The detection procedure is applied to the ratio of the energies: 

RW=E^/E$\ (9) 

As already mentioned in the case of the energy detector, under the Gaussian assumption on x the 
energies are x 2 distributed with N degrees of freedom and scale parameter equal to the variance 
of the Gaussian variables. The ratio of the \ 2 variables is a Fisher-F random variable denoted 
o 2 T(N, N) with N and N degrees of freedom and scale parameter o 2 equal to the ratio of the 
scale parameters of the \ 2 variables. Thus the Fisher-F detector can be formulated as testing the 
two hypotheses: 

f Ho : RW~F(N,N), 

\ Hi : Ri N) ~ (1 + snr)J 7 (N, N), 1 ' 

where the random variable o 2 !F(df, df) has a Fisher-F PDF of the form: 

i r(df) {x/o 2 r' 2 -^ 

Wk*)W - ^r(df/2) 2 (i + x/o 2 Yf 



2.4 Likelihood Ratio Tests 
2.4.1 Generality 

Given a random variable X having a PDF /e(x) defined by the parameter 9, the likelihood function 
1(8; x) of a sample x of X is defined as 1(6; x) = fe(x). The Likelihood Ratio Test (LRT) consists 
in testing simple hypothesis Hq under which X follows /o (x) = fg (x) versus the simple alternative 
Hi under which X follows /i(x) = /^(x) by setting a threshold r on the likelihood ratio statistic: 

If A(x) > t hypothesis Hq is rejected and vice-versa. The notation: 



Hi 

A(x) ^ t, (13) 



is often adopted to evoke the LRT. 



Error probabilities of the test The LRT is intended at taking a decision given a finite sample 
of a random variable. The performance of a test is evaluated by the decision error probabilities. 
The probability of miss P m is (also called the probability of error of first kind) is the probability of 
deciding Hq when Hi is true and the probability of false alarm Pj a (also called the probability of 
error of second kind) is the probability of deciding Hi when Hq is true. The threshold r determines 
the error probabilities. An ideal test would require to choose r such that both error probabilities 
tend to zero. Unfortunately the probability of miss increases when the probability of false alarm 
decreases. The user has to decide which one of the errors is preferable. However, the LRT is often 
referred as the optimal test in the sense that for a given Pf a , the LRT provides the smallest P m is- 



Decision regions The LRT splits the sample space into two exclusive regions of acceptance of 
Ho or Hi. For fixed sample size N, the boundary between the decision regions depends on the 
probabilities of error through the setting of r and N. Thus, given a sample size N and a threshold 
t, Pmis and Pf a are uniquely defined. 



2.4.2 The Energy detector 

For the LRT involving the likelihood function (7) of the energy E% , the test statistic (12) takes 
the form: 

A (W) (x) = (^-Y^ eM^T^4 N) }, (14) 
\ 1 + snr J 2af, 1 + snr 

where for sake of simplicity in the notations, the dependence on Ex is replaced by a dependence 
on x = x\, . . . ,xn and snr = . It is often more convenient to test the log-likelihood A("'(x) of 
the data: 

AW(x) =log{AW(x)} = -^—±- E W -^l g{l + w}. (15) 

1 + snr 2a£ 2 

One can see that the log-likelihood ratio statistic depends on the energy E^ through parameters 
snr and cr 2 , which are common to both hypotheses. Hence, when testing two normal distributions 
with zero mean and different known variances, the energy detector is equivalent to the LRT. 



2.4.3 The Fisher-F Ratio detector 

The log-likelihood ratio of the energy ratio Ft!x is defined from expression (11) of the Fisher-F 
PDF by: 

hS N) {x) = — log{l + snr} + AHog{l + x] - JVlog{l + snr + x}. (16) 

Unlike the log-likelihood ratio (15) involved in the energy detector, this test statistic depends only 
on the signal-to-noise ratio snr. Knowledge on the noise variance is not required to implement the 
Fisher-F test. This is the reason why this test is currently preferred in the OSCAR experiment. 



2.4.4 Required sample number 

Expressions (6) and (10) show that the parameter of interest for discriminating between the hy- 
potheses is the scale parameter erf. In both cases, the scale parameters are such that a\ = 
(1 + snr)<jQ. The cumulative distribution functions Fi(r) = J fi(u)du of the x 2 and the Fisher-F 
random variables for i — 0, 1 satisfy the relation: 

F 1 (r) = F (— —). (17) 
1 + snr 

The probability of miss can thus be expressed as a function of the probability of detection, param- 
eterized by snr, and the sample size N . The required number of samples can be evaluated for a 
given test with a given strength (P/ a , Pmis) = (cc, (3) by constraining P/ a , and choosing N such 
that Pmis is reached. 



3 Sequential Detection Procedures 

Unlike fixed sample size procedures, the sample size required in a sequential procedure is a random 
number called the sample number. The Average Sample Number (ASN) can be dramatically 
smaller than the corresponding sample size N required to perform a fixed sample size test of at 
least same strength (a, (3) [15, 17]. The Relative Sample Efficiency (RSE) of a sequential procedure 
with respect to a fixed sample size procedure is the ratio A^/ASN. 



3.1 Sequential Probability Ratio Test 



A fixed sample size test splits the sample space into two decision regions. A sequential procedure 
splits the sample space into three regions; the region of acceptance of Hq, the region of acceptance 
of Hi, and the continuation region where the decision is postponed and another sample is acquired. 
More specifically, suppose a decision has to be taken from the likelihood ratio statistic A*-") com- 
puted from the n first available samples x\, x%, . . . , x n . Two constants A and B are chosen such 
that A> B and the SPRT is defined as follows: 

AW < B : accept H , 

B < \( n > < A : postpone the decision, (18) 
> A : accept H 1 

The thresholds A and B define the boundaries between the three decision regions. The stopping 
time or sample number N s is defined by: 

N s = mm{N 0l N 1 }, (19) 

where Nq (resp. Ni) is the first time the likelihood ratio statistic crosses the boundary A (resp. 
B). The probability that such a test terminates is one. Wald and Wolfowitz have shown that 
among all the sequential test, the SPRT provides the smallest ASN under both hypotheses [18]. 
For a desired strength (a, (3) of test, the boundaries are set by the Wald's approximations [9, 17]: 

A = i— - (20a) 
a 

B = — — . (20b) 
1 — a 

These approximations hold for small error probabilities, typically smaller than 0.05. 



Operating Characteristic and Average Sample Number function The design of the SPRT 
assumes that the parameters 9i defining the hypotheses are known. The behavior of the test 
strongly depends on this assumption. In particular, the ASN can dramatically increase if the true 
parameter 9 does not match the hypotheses. The Operating Characteristic L(9) is the probability 
of accepting Ho when the true parameter is 9. In particular, L(9q) = 1 — Pf a and L(9{) = P m i S . 
The operating characteristic is an efficient tool for evaluating the performances of the test under a 
model mismatch. 

Wald shows that the operating characteristic of the SPRT can be approximated by: 

m « ^— gjr, (2i) 

where h is solution of the integral equation: 

r+ °",wM) h}[x ' e)dx=1 ' (22) 

One can see from this equation that h depends on the true parameter 9. For instance the solutions 
in the cases 9 = 9q and 6 = 6\ can be computed by noting that f(x,9) is a PDF: 

h = l if = 0o, 

h = -1 if 9 = 9 X . { > 

In general, h cannot be evaluated explicitly for every 9 and one has to approximate it numerically 
A possible approach suggested by Wald consists of solving equation (22) by finding the value of 9 
for which the equation is verified by a given value of h [17]. 

Wald makes use of the operating characteristic (21) and of the approximations (20) to derive an 
approximation to the ASN E{N\9} required to stop the SPRT when the true parameter is 9: 

F t N W_ miQS{B} + (l-L(9))log{A} 

E{A( X )\9} ' (24) 

where A(x) = A' 1 ) (a;). Unlike the operating characteristic, the ASN depends on the model through 
the expected value of the log-likelihood ratio. 



3.2 The Approximation of the Log-Likelihood Ratio Statistic as a Brow- 
nian Motion 



Most of the properties of the SPRT are model-independent. In particular, the statistical behavior 
of the SPRT can be studied by approximating the sequential log-likelihood ratio statistic as a 
Brownian Motion [15]. 

3.2.1 The Brownian Motion 

Definition of a Brownian Motion. A Brownian Motion (also called a Wiener process) 
W(t) ~ BM{ii, a 2 ), ^ t < oo with drift \i and variance a 2 is a random process such that: 

• W{0) = 0; 

• W(t) - W(s) ~ Af(n(t - s), a 2 (t - s)), for all ^ s <t < oo; 

• for all < si < fa < s 2 < t 2 < oo, W(t) - W(s) ~ Af(/j,(t - s),a 2 (t - s)), the random 
variables W(t\) — W(si) and Wfa) — W(s2) are independent; 

• W(t), ^5 t < oo is a continuous function oft. 

If the mean and variance of A^^x) are linear functions of the sample size N, the log- 
likelihood ratio can be considered as the sampled Brownian Motion W(t) = W(NT S ) = AW(x) - 
BA4(n,a 2 ). Then by equating the first and second moments of under Ho and Hi, one can 
compute the drift /ij and the variance a 2 under hypothesis Hi. The test can then be formulated 
under a new model: 

" H : W(t)~BM(m>,o$), 
Hi: W(t)~BM(pi,o$ 



2, (25) 



where fii and Oi are parameters characterizing the performance of the test procedure. 
The drifts and variances are derived from the first and second order moments of the test statis- 
tic. Thus the approximation of the sequential likelihood ratio statistic as a sampled version of a 
Brownian Motion is valid up to the second order moment. A drawback of such an approximation 
is that the skewness of the distribution is not taken into account. This leads to an overestimation 
of the expected value of the stopping time in many situation. This phenomenon is known as the 
overshooting [20]. The overshooting phenomenon has been evaluated numerically for some tests 
[15]. 

3.2.2 Truncated SPRT and prediction of the sample number 

In many applications, the number of samples available is limited. After a given time, a decision has 
to be made even though the sample still spans the neutral region. Define the truncated Sequential 
Probability Ratio Test (TSPRT) with stopping rules: 

To = min{inf{n: A(») ^log{A}},A^}, 

Ti = min{inf{n : A(") ^ log{B}}, AT}, [ ' 

where N is the maximum practicable sample size. Under both hypotheses, if Tj < A^ then the 
estimated stopping time is A/j = T i: else, thanks to the definition of Brownian Motion, one can 
predict what should be the stopping time if additional samples were to be taken. Indeed, JVj is 
such that 

W{N l T s ) - W(NT S ) ~ MQi(Ni - N)T S , a 2 (N z - N)T S ). (27) 

By noting that W(NT S ) = A^ and W{N t T s ) = \og{A] under Ho and W{N t T s ) = log{B} under 
Hi one can express: 

/ = Ar+J_(log{A}-AW), 



On Figure 4 are presented the histograms of the estimated ASN Eo{Nq} and Ei{N\} for lengths 
A^ = 1000 and A^ = 5000 for the SPRT based on the energy statistic described in next section. One 




Figure 4: Experimental evaluation of the accuracy of the prediction of the required sample number. 
The histograms of the prediction (dashed line) are superimposed to the histograms of the actual 
sample number (plain lines). 



hundred trials have been performed. The strength of the test is Pf a = m %s= 0.02. The signal-to- 
noise ratio is —20dB before filtering. It has been chosen such that for each trial, 1000 < No, N% < 
5000 with high probability. Thus for each trial, when N = 1000 (dashed lines) the sample number 
has to be predicted and when TV = 5000 (plain line), it can be observed. The tail of the true 
histogram is heavier than for the predicted histogram. Indeed, the prediction scheme is based on 
the approximation of the likelihood ratio as a Brownian Motion. The distribution of the predicted 
sample number is closer to a Gaussian distribution. The consequence is a reduction of the skewness. 
Thus like for the approximation (24) proposed by Wald, the ASN is slightly under-estimated by 
approximating the likelihood ratio sequence as a Brownian motion. 



3.3 SPRT based on the energy statistic 

In the case of x 2 ( or Gamma) distributions, Bartholomew [3] and Phatarfod [13] have proposed 
specific formulations of the procedure for application of a sequential test to analysis of the arrival 
time of a random event. These works concern distributions which differ under the null hypothesis 
and the alternative by the number of degrees freedom. We are concerned with distributions which 
present the same number of degrees of freedom, namely, the sample number. 
The expected value and variance of a ct 2 Xn random variable are: 

E{a 2 X %} = No 2 . (29) 
Var{a 2 X %} = 2TV(a 2 ) 2 , (30) 

so, under the hypotheses (6), the test statistic (15) can be approximated by the following Brownian 
motion: 

J H : W (t)^BM(^^-llog{l + snr},^L_), 

\H i: W{t)~BM{^-\\og{l + S nr} lW ^), 1 ! 

Figure 5 displays the ratio of the ASN to the number of sample required for the fixed sample 
LRT with same error probabilities. The ASN have been estimated as the average of the sample 
numbers computed from 100 trials of 20000 points. After filtering the sub-sampling reduces the 
sample size to TV = 250. When the test failed to stop before TV, the prediction procedure described 
in section 3.2 was applied. The number of samples required for the fixed sample size test has 
been computed numerically from the known expression of the error probabilities as described in 
section 2.4.2. 

At higher SNR, the number of samples required for both the sequential test and the standard test 
are of the order of the unit, so the computed ratios cannot be considered as reliable. At low SNR, 
the RSE is around two. The RSE increases when the error probabilities decrease. This makes the 




SPRT particularly attractive when the level of confidence put on the decision is to be high. 
The ratios predicted from Wald's approximation (24) of the ASN are superimposed to the ex- 
perimental ratios. At low error probabilities, one can see that Wald's expression over-estimates 
the true ASN. This phenomenon is associated with the skewness of the PDF (highlighted by Fig- 
ure 4) [15, 19]. It is of interest to notice that when the error probabilities increase (see the case 
Pf a = P m is = 0.15), approximation (24) no longer holds and the ASN tends to be under-estimated. 
However, the improvement is still significant. Above this value, there is no gain in using a sequential 
test. 

3.4 SPRT based on the Fisher-F statistic 

Jackson [7] and Jennison [8] have derived exact values of the operating curve and ASN for the 
sequential Fisher test when the parameter of interest is the mean of a normal population. Like 
Bartholomew in [3] and Phatarfod in [13], Jennison makes use of Cox's theorem to transform the 
observation and apply the test to a new statistic which is independent on the number of degrees of 
freedom. Such approach cannot be adopted in our case as the parameter of interest is the variance 
of the normal population. 

An exact expression of the expected value of the log-likelihood (16) has yet to be derived. We 
propose in the Appendix a derivation of an approximation that holds at low snr. One can see with 
expressions (49) that the expected value of the log-likelihood is not a linear function of the number 
of data. Thus, the approximation as a Brownian Motion is not valid for the Fisher-F test statistic. 
Consequently, the sample size prediction procedure does not hold and the experimental evaluation 
of the ASN cannot be performed. On Figure 6 is presented the theoretical ratio of the ASN under 
Hi to the number of samples required for the fixed sample size test of same strength at low snr 
(— 30dB before filtering). 

When the error probabilities are smaller than 0.06, the SPRT significantly reduces the required 
sample size. A strength of a = f3 = 0.02 can be achieved by a fixed sample size test if the sample 
size is of the order of 10 7 . The use of the SPRT allows a reduction of the order of 10 6 data samples. 
When the error probabilities are above 0.06 the ratio is smaller than 1, which is due to the fact 
that the Wald's approximation is not reliable for large error probabilities. 

The right-hand side of Figure 6 displays the ASN of the Fisher-F test to the ASN of the \ 2 test 
ratio computed from expression (24) and approximation (49) under H and Hi. As far as Wald's 
approximation holds, one can see from expression (24) that this ratio is independent of the error 
probabilities. At low signal-to-noise ratio, the ASN of the Fisher-F test is 4 times the ASN of the 
X 2 test. 



ASN to fixed sample size ratio for the Fisher-F test ASN of the Fisher-F test to ASN of the test ratio 

1 1 1 1 1 1 1 4.25 1 1 1 1 1 1 1 1 r— 




Figure 6: Evaluation of the Fisher-F test: RSE of the Fisher-F test under Hi for various error 
probabilities (left). The snr before filtering is — 30dB. Ratio of the ASN of the Fisher-F test to 
the ASN of the x 2 test (right) uner Ho (dashed line) and under Hi (plain line). 

4 Conclusion 

The advantages of Sequential Probability Ratio Tests (SPRT) for detection in a \ 2 and Fisher-F 
model has been investigated. Evaluation of the relative sample efficiency of the SPRT with respect 
to the corresponding fixed sample size test has been computed on simulation data. 
For these test, it was experimentally verified that a gain of 50% in the required integration time 
can be expected at low SNR (below — 30dB). The appealing feature of the proposed model is that 
the test statistic can be written in a closed form which avoid making use of approximations. 
A similar study has been performed to evaluate the efficiency of a Fisher-F type test based on 
the ratio of the energies of the in-phase and quadrature channel outputs. This test requires only 
the knowledge of the signal-to-noise ratio and can be performed whatever the noise variance. The 
consequence is an increase in the ASN of he Fisher-F test in comparison to the x 2 test. At low 
error probabilities, the ASN is still smaller than the number of samples required to perform a fixed 
sample size Fisher-F test of same strength. 

Another direction required for application to the OSCAR experiment is the evaluation of SPRT 
performances for unknown bandwidth of the spin signal. This will allow SPRT to be developed for 
the filter bank implementation of the OSCAR experimental apparatus. 

Appendix 

Expected value of the statistic of the Fisher test 

The log-likclihood ratio of the Fishcr-F test is of form: 

A (Ar) (x) = — log{l + snr} + ATlog{l + x} — N log{l + snr + x}. (32) 

The expansion into Taylor series of this function around snr = provides the approximation for 
small snr: 

M 

AT sr>r m 
AW(x) = - log{l + snr} + N £ (-1)" - + o(M). (33) 

m— 1 ^ ' 

Thus the expected values of the likelihood ratio under both hypotheses involves the terms £^{1/(1 + 
x) m }. 



Computation of Eo{ m^ys }• Under Ho these terms are written: 

r 1 1 / ,+oc ' 1_ T(7V) x^ 2 ' 1 ^ 

ol (i + x)™ } _ X (i + x)™r(7v/2) 2 (i + x)^ rfa; ' (34) 

T(N) f +oc 1 x^ 1 , 

j^dx. (35) 



T(iV/2) 2 y (l + x) m (1+x) 
After the change of variable y = log(l + x), this term takes the form: 

E n { - | = T(AQ r+°° (N+m _ 1)y _ 1)N /2-i d 

oi (i + xK r(iv/2) 2 J 1 ' y ' 

A m (N,l), (36) 



r(iV/2) 2 

where the integral A m (N, b) is defined by: 



A m (N,b)= C - {N+m - b)y {e y -l) N t 2 - b dy. (37) 



An integration by parts involving the functions: 

a(v) = fe y -lW 2_6+1 

9KV) N/2-b+V ' 

show that under the condition b < n/2, A m (n, b) satisfies the recursive equation: 

A m (N,b) = a m (N,b)A m (N,b-l), (38) 

where a(N, b) = N j^/™Zb+i ■ This equation leads to the relation: 

V6,c A m (N,c)= nb Am{N '%, v (39) 
By noting that for the case b = N/2, A m (N, N/2) = 2/ (N + 2m), expression (39) takes the form: 

Vc A m (N,c)= ^ (40) 

llk=c+l a ( N > k > 

This equality holds for even values of N. It is asymptotically true for the odd values of N. The 
denominator can be expressed in terms of Gamma functions: 

J] a(N,k)- W + m-c+1) 



T(N/2 -c+ l)T(N/2 + m + 1) ' 

and the integral A m (N, b) takes the form: 

r(N/2-b + l)T(N/2 + m ) 
A »^ N > b >- T(N + m-b+l) ■ (41) 

By injecting this expression into (36), the expected value of 1/(1 +x) m under Hq is finally written: 

F f 1 1 = T (N)T(N/2 + m) 
01 (1 + x)™ 1 r(A^/2)r(7V + m) ' 1 ; 



Computation of E\{ m^ys }• The expected value of 1/(1 + x) m under H\ is of forr 



_L T(JV) f +oc 1_ _L {x/(l + snr)*' 2 - 1 

■(l + x) m * ~ T(N/2) 2 J (1 + x) m 1 + snr (1 + ar/(l + snr)) N 



After the change of variable y = x/(l + snr) this expression takes the form: 

1 T(N) f + °° yN/2-l 

El{ TTT^r } = TWWJo h ^ snr) JTT^W dx - (44) 

where h(y, snr) = ^i +y ^]_ snr ^ m ■ An expansion of h(y, snr) into Taylor series around snr = 0: 



leads to the expression: 



U (l + ^) m ~ T(A/2) 2 ^ l ; r(ro) fc! ./„ (1 + y) m + fc (1 + y) N U ' 



where one can recognize the integrals A m (N, 1 — fc) previously defined. Finally the expected value 
of 1/(1 4- x) m under H\ takes the form: 

^4y )trj ^r^ (N - 1 - *>■ (47) 

The expected values of the likelihood ratio statistic take finally the forms: 

£o{AW(z)} « _to g {i + ™ r } + tf_^J r A m (7V,l), (48) 

2~ 



£;i{A (JV) (x)} « ^log{l + snr} 
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